Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md

README.md

Fill out this form to register your intent to complete this project.

Fill out this form to submit your solution to this project and qualify for the rewards.

Reinforcement Learning Based Fault Tolerant Control of a Quadrotor

Develop a fault-tolerant controller for a quadcopter using model-based reinforcement learning.

Motivation

Unmanned aerial vehicles (UAVs), such as quadcopters, are nowadays popular vehicles adopted for various applications. The actual operation of quadcopters can be affected by disturbances such as wind, rain, dust, etc., which may cause component faults. Fault is defined as a change in a system's property or parameters that cause the system to behave differently from its design. A fault-tolerant controller (FTC) is a control strategy that aims to improve the performance of a system that is operating in degraded performance due to fault [1].

Deployment of multi-rotor drones for applications like Urban Air Mobility (UAM), product delivery, etc., requires a proper behavior of the vechile at all times to ensure safety of the environemnt and people nearby. That is why a FTC is a necessary block for such systems and both academic researchers and industry professionals are looking to improve methods to observe faults and provide a control strategy able to overcome them and ensuring safe behavior.

FTCs are characterized as model-based or data-driven based on the method used to develop the controllers. Model-based techniques necessitate knowledge of the system's model and parameters in order to design a fault-tolerant controller. Data-driven approaches, on the other hand, learn the FTC directly from system data. The fundamental problem of model-based FTC approaches is that their effectiveness is dependent on the correctness of the system model, which is difficult to establish when system parameters vary due to faults. Furthermore, complex systems necessitate complicated controllers, which has an impact on the controllers' robustness. Data-driven techniques, on the other hand, utilize data to design FTC without knowing the full dynamics of the system. As a result, data-driven methods, particularly reinforcement learning (RL)-based techniques, have recently gained the attention of a number of researchers.

Project Description

Train an RL agent to develop a fault-tolerant controller for a quadcopter using model-based reinforcement learning. The framework uses the system dynamics and a Kalman filter-based estimator to estimate the fault-related parameters online, which will be used to identify the occurrence of a fault in the system. Once you identify the event of a fault, you will use the fault-related parameters to train an RL agent that tunes the position and attitude controller gains of the quadcopter to compensate for the happening fault.

Suggested steps:

Review Tune PI Controller using Reinforcement Learning example to learn how to use the Reinforcement Learning Toolbox to tune a PI controller for a system.
Review Quadcopter Drone Model in Simscape example, which contains a detailed model of the quadcopter including the airframe, battery, and propulsion systems, and learn how a PID control can be applied for a quadcopter's position and attitude control.
Design a reward function that will be used for training the RL agent (consider a reward function that takes into account the error between the reference and actual trajectory). To represent fault behavior of the system, you may use the equivalent resistance of the motors as in [2].
Use the simulation environment to simulate faulty behaviors and train an RL agent to tune the quadcopter position and attitude PID controller gains.
Apply the trained model for tuning the PID controllers in the presence of fault/s.

Advanced Project work:

Implement a state estimator for monitoring the fault related parameters that will be used for training the RL agent (you may refer to Fault Detection Using an Extended Kalman Filter, [2], and [3]).
Consider complete sub-component failure instead of fault (degradation).

Background Material

Examples:

Impact

Improve safety of multi-rotor drones

Expertise Gained

Drones, Artificial Intelligence, Robotics, Control, Reinforcement Learning, UAV

Project Difficulty

Master's, Doctoral

Project Discussion

Dedicated discussion forum to ask/answer questions, comment, or share your ideas for solutions for this project.

Project Number

235

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reinforcement Learning Based Fault Tolerant Control of a Quadrotor

Reinforcement Learning Based Fault Tolerant Control of a Quadrotor

README.md

Reinforcement Learning Based Fault Tolerant Control of a Quadrotor

Motivation

Project Description

Background Material

Impact

Expertise Gained

Project Difficulty

Project Discussion

Project Number

Files

Reinforcement Learning Based Fault Tolerant Control of a Quadrotor

Directory actions

More options

Directory actions

More options

Latest commit

History

Reinforcement Learning Based Fault Tolerant Control of a Quadrotor

Folders and files

parent directory

README.md

Reinforcement Learning Based Fault Tolerant Control of a Quadrotor

Motivation

Project Description

Background Material

Impact

Expertise Gained

Project Difficulty

Project Discussion

Project Number